Update webui to handle reasoning content and include usage stats in server only when requested #791

firecoperana · 2025-09-24T00:29:20Z

The following changes are included in this PR:

Change reasoning format's default value to auto and make current webui compatible with this change (
Support streaming delta.reasoning_content in WebUI ggml-org/llama.cpp#15052). This should be fine with most 3rd party front end as they should be updated by now. If not, use --reasoning-format none to start server.
Add config in current webui for reasoning format. When encountering parsing issue for reasoning content, switch between none and auto to see which one fix them.
server : include usage statistics only when user request them (
server : include usage statistics only when user request them ggml-org/llama.cpp#16052). This could be a breaking change, but is compatible with openai API.
server : only attempt to enable thinking if using jinja (
server : only attempt to enable thinking if using jinja ggml-org/llama.cpp#15967)

server : include usage statistics only when user request them (#16052) server : only attempt to enable thinking if using jinja (#15967)

handle reasoning content in webui

f8ed961

server : include usage statistics only when user request them (#16052) server : only attempt to enable thinking if using jinja (#15967)

firecoperana requested a review from ikawrakow September 24, 2025 00:29

firecoperana self-assigned this Sep 24, 2025

config reasoning_content in webui and change default to auto

e5aa602

firecoperana force-pushed the fcp/webui_reasoning_handle branch from cb93683 to e5aa602 Compare September 24, 2025 00:38

firecoperana mentioned this pull request Sep 24, 2025

Bug: stream response without <think> token #776

Closed

ikawrakow approved these changes Sep 24, 2025

View reviewed changes

ikawrakow merged commit 09db3a4 into main Sep 24, 2025

firecoperana deleted the fcp/webui_reasoning_handle branch October 26, 2025 16:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update webui to handle reasoning content and include usage stats in server only when requested #791

Update webui to handle reasoning content and include usage stats in server only when requested #791

Uh oh!

firecoperana commented Sep 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update webui to handle reasoning content and include usage stats in server only when requested #791

Update webui to handle reasoning content and include usage stats in server only when requested #791

Uh oh!

Conversation

firecoperana commented Sep 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants